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SEARCH ENGINE determined according to the programmed criteria of the 

search engine, and the emphasis placed on particular types 

This application is related to U.S. Application No. Scr. ot site design, rather than any measure of the actual users' 

60/078,199 entided "Improved Search Engine" that was opinions. Indeed this can lead to the absurd situation 

filed on Mar 16 1998 5 whereby in an attempt to ensure a favorable rating by the 

most commonly used search engines, some designers delib- 

FIELD OF THE INVENTION erately configure their sites in the light of the previously 

m , , , , mentioned criteria, to the detriment of the presentation, 

The present invention relates to a method and apparatus readabilit and of tbe site 

that allows for enhanced database searching, and more 

particularly; for use as an internet search engine. 10 SUMMARY OF THE INVENTION 

„ m „ rwww n-i A ^ It is an object of the present invention to ameliorate the 

BACKGROUND OF THE RELATED ART aforementioned disadvantages of conventional search 

An efficient and practical means of obtaining relevant engines by harnessing the cerebral power of the human 

information and also screening unwanted/uninteresting 15 operator. 

information has been an ongoing need, especially since the It is a further object of the present invention to provide a 

inception of the internet. This need is particularly acute at novel search engine with enhanced efficiency, usability and 

present due to the exponential growth in the number of effectiveness with a reduced system storage and/or compu- 

world-wide web sites and the sheer volume of information tational requirements in comparison to existing software 

contained therein. In an attempt to index the information ^ engines. 

available on the internet, a number of software search It is a further object of the present invention to provide a 

engines have been created via which a user enters a search variety of indications of the popularity of the search data, 

command comprised of suitable keywords from a keyboard together with an indication of its date of creation or updat- 

at his personal computer. The search command is transmit- m g 

ted to a server computer, the has a search engine associated ^ ln order to obtain the above recited advantages of the 

with the server computer. The search engine receives the presen t invention, among others, one embodiment of tbe 

search command, and then using it scans for these key words prcS ent invention provides for a method of updating an 

through a database of web addresses and the text stored on internet search engine database with the results of a user's 

the web sites. Thereafter, the results of the scan are trans- selection of specific web page lists from the general web 

mitted from the server computer back to the user's computer x page lining provided to the user as a result of his initial 

and displayed on the screen of the user's computer. keyword search entry. By updating the database with the 

In order for the search engine to be aware of new web sites selections of many different users, the database can be 
and to update its records of existing sites, either the propri- updated to prioritize those web listings that have been 
etors of the web sites notify the search engine themselves or selected the most with respect to a given keyword, and 
the information may be obtained via a * web crawler* to 35 hereby presenting first the most popular web page listings in 
update the database at the server computer. A web crawler is a subsequent search using the same keyword search entry, 
an automated program which explores and records the j n another embodiment of the present invention, a method 
contents of a web site and its inks to other sites, thereby 0 f determining content to provide along with listings trans- 
spreading between sites in an attempt to index all the current m itted from a server computer to user sites is provided. In 
sites. 40 this emtx>diment, there is obtained a content listing from 

This database structure and method of searching it poses each one of a plurality of different developer sites. Each of 

some significant difficulties The internet growth-rate has the content listings includes content, a developer identifier, 

resulted in a substantial backlog in the scanning of new sites, and a keyword, and a keyword selection factor. Thereafter, 

notwithstanding the fact that web sites are frequently there is determined a particular keyword from the obtained 

deleted, re-addressed, updated and so forth thus leaving the 45 keywords that is the same for different content listings. For 

search engine with outdated and/or misleading information. that particular keyword, the keyword selection factor is used 

Although the web crawlers can be configured to prioritize; in determining when to transmit different content listings to 

possible key-words according to their location (title, embed-/ the user sites. 

ded link, address etc), nevertheless, depending on the type of In still another embodiment, there is provided a method of 

search engine used, substantial portions of the web site text 50 ' updating a keyword table with the results of a user's 
(often involving the majority or even all of the site text) is , selection of specific keywords which were obtained from a 
still required to be scanned. This results in colossal storage L hst of related keywords presented to the user. By updating 

requirements for the search engine. Furthermore, a typical , m e database with selections of many_different users asspci- J 
key word search may bring up an excessively large volume 4ated~w5h t same keyword, appropriate keywords can be 

of material, the majority of which may be of little interest to 55 provided and presented first when that same keyword is 

the user. The user typically makes a selection from the list ) subsequently entered. 

based on tr^ brief descriptions oLthe.site and explores the Wnp nFSTRIFTION OF THE DRAWINGS 

chosen sites until the desired information is located. BRIEF DESCRIPTION OF TIIE DRAWINGS 

These results are in the form of a list, ranked according to rrhese and olber advantages of the present invention may 

criteria specific to the search engine. These criteria may 60 be appreciated from studying the following detailed descrip- 

range from the number of occurrences of the key-words ^ on of toe preferred embodiment together with the drawings 

anywhere within the searched text, to_ methods giving a m which: 

/weighting to key-words used in particular positions (as FIG. 1 illustrates certain of the overall features of the 

previously mentioned). When multiple key-words have been ~p present invention; 

used, sites are also ranked according to the number of 65 FIG. 2 illustrates various inputs to the search, and, for 

different key-words applicable. A fundamental drawback of each often different capabilities, illustrates the outputs that 

Tall' these ranking systems is their objectivity — *hey~are will be provided engine according to the present invention; 



us 6,4: 
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FIGS. 3A and 3B illustrates an overview of the process by 
which web pages are selected in making up the search results 
provided to the end user accord to the present invention; 

FIG. 4 illustrates the data sets used for different web-page 
searches according to the present invention. 

FIG. 5 shows e various data sets previously describe and 
various inputs and actions that result in a list of suggested 
web pages being provided according to the present inven- 
tion; * 

FIG. 6 illustrates the implementation of a popular search 
according to the present invention: 

FIG. 7 illustrates the implementation of a hot off the press 
search according to the present invention: 

FIG. 8 illustrates the implementation of a high-flyers 
search according to the present invention: 

FIG. 9 illustrates the implementation of a random search 
according to the present invention: 

FIG. 10 illustrates the implementation of a previous past 
favorites search according to the present invention. 

FIG. 11 illustrates the implementation of a collective 
search according to the present invention. 

FIG. 12 illustrates the implementation of a date created 
search according to the present invention. 

FIG. 13 illustrates the implementation of a customized 
search according to the present invention. 

FIG. 14 illustrates the implementation searching based 
upon a group identity according to the present invention. 

FIG. 15 illustrates a keyword eliminator feature according 
to the present invention. 

FIG. 16 illustrates the process of determining which 
search results should be used to make up the cumulative 
surfer trace table according to the present invention. 

FIG. 17 instates active suggestion of web pages according 
to the present invention. 

FIG. 18 illustrates passive suggestion of web pages 
according to the present invention. 

FIG. 19 provides an overview of suggesting keywords 
according to the present invention. 

FIG. 20 illustrates the manner of creating data sets for 
suggested keywords according to the present invention. 

FIG. 21 illustrates a variety of manners in which a list of 
suggested keywords can be created according to the present 
invention. 

FIG. 22 illustrates how content is attached to web page 
listings according to the present invention. 

FIG. 23 illustrates various content data sets and opera- 
tions that populate them according to the present invention. 

FIG. 24 illustrates various content data sets and opera- 
tions that are used to select data from them a according to the 
present invention. 

FIG. 25 illustrates web page listings and other content 
data according to the present invention. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

FIGS. 1A and IB illustrate certain of the overall features 
of the present invention, which will be described in further 
detail hereinafter. It is initially noted that like numbered 
reference numerals in various Figures and descriptions will 
be used in the following descriptions to refer to the same or 
similar structures, actions or process steps. 

The present invention is preferably implemented in a 
network environment wherein each computer contains, 
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typically, a microprocessor, memory and modem, a certain 
of the computers contain displays and the like, as are well 
known. As shown in FIG. IB, a plurality of user sites/ 
computers 100A-100D are shown, as are a plurality of 

5 server computers 102A-B, and developer sites/computers 
104A-B. It is understood that in a typical internet network, 
that different server computers 102 can be interconnected 
together, as is illustrated. Further, while only a few user 
sites, developer sites and server computers are shown, it is 

10 understood that thousands of such computers are intercon- 
nected together. 

While the specific embodiments of tee present invention 
arc written for applications in which the invention is imple- 
mented as sequences of coded program instructions operated 

15 upon by a server computer 102 as illustrated, it will be 
understood that certain sequences of these program instruc- 
tions could instead be implemented in other forms, such as 
processors having specific instructions specifically tailored 
for the applications described hereinafter. 

20 As will be illustrated hereinafter, additional operations, 
transparent to the user, are implemented in order to obtain 
search results in the future based upon currently made 
searches. As shown, the present invention has various 
capabilities, each of which are illustrated in a parallel flow 

25 in FIG. 1A, which illustrates an overview of the different 
capabilities that can be ongoing simultaneously. In terms of 
overall capabilities, start block 10 show three: tsuggesting 
web pages 12, suggesting keywords 14, and content sug- 
gestion 16. " 

"~ In order for web pages 12 to be selected by a user/ 
according to the present invention; mc^^a^tep48 in which ; 
the type of search to be performed is selected. Thcreaftcr^m 
step 20,, search input obtained from one of a variety of 
3S csources is input and used along with the-algqrithm selected 
in step 18 to determine search results. The results of this 
search are then displayed to the user, as shown by steps of 
displaying a created list of web pages/ displaying passively 
suggested web pages, and displaying actively suggested web / 
^ /pages, identified as steps 22, 24, and 26, respectively, in FIG: 
( 1. This capability, and how it is implemented, will be 
described in more detail hereinafter. 

In orde? for keyword suggestion to take place, which the 
user may or may not select, there is preferably an initial step 
45 ^28 7 in which the type of keyword search olgorithm to use is 
selected. Although many systems may have only one such 
algorithm, various ones, as described hereinafter, are pos- 
sible. Once the keyword search olgorithm is selected, step 
^30 follows in which, based upon a keyword entered by a 
50 user, the current set of keyword data is operated upon to 
determine associated keywords. The results of this operation 
arc then displayed to the user in 30. This capability, and how 
^it is implemented, will be described in more detail herein- 
after. 

55 The previously mentioned web page and keyword selec- 
tion capabilities inured to the direct benefit of the end user. 
Another novel feature of the present invention, which indi- 
rectly inures to' the benefit of the end user, directly benefits 
the advertiser, because it allows for content to be targeted in 

60 real time based upon various criteria. As will be described 
more fully hereinafter, a content providing olgorithm is 
initially selected which will determine how content is 
selected in step 34. Step 36 follows, and based upon inputs 
from users and content providers, which content to show is 

65 determined. Thereafter, the advertisements are displayed for 
/ the user to see, simultaneously with the display of either 
keywords and/or web pages > 
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While FIG. 1 illustrates certain overall features according appropriate, the structure of the dependent data-sets used to 

to the present invention, many of the advantageous features create the defined data-set are shown in tabular form) with 

of the present invention are not, as mentioned previously, reference to the preferred embodiment of the present inven- 

observable to the user, but instead transparent to user. They tion. Thereafter, certain of these will be explained in even 

are, however, significant in order to fully explain how the 5 greater detail to fully teach how to make and use the present 

present invention is implemented and are explained herein- invention. 

after. Locations: a plurality of unique information entities. 

FIG. 2 is provided to illustrate various inputs to the search Web-pages: Locations in the form of Web-pages URL 

engine according to the present invention, and, for different Universal Reference Locator) addresses, 

capabilities, illustrates the outputs that will be provided. 10 Key-word: Hie word or phrase that is entered in the 

More detailed explanations are provided hereinafter. Data search engine 

that is potentially input from search engine user include: /Hit-list: The list of web-pages URL addresses) that is the 

keyword 52 — this is the word or phrase that the user eater result G f the key-word search. This hirtist ranks the relr 

to find a list of web pages evance of the web-pages relative to the key-word. This 

profile types 54 — these are the groups of people they bit-list always has a key-word associated with it: / 

associate them selves with e.g. US, male, doctor etc. 
user ID 56 — this is a unique identification for each user 

that chooses to register with the search engine. This can 

be done via a cookie or logon. 20 
search type 58 — this can be actively chosen by the 

searcher to determine the type of search results they 

would like (popular, new, etc) 

date-time 60 — this is passively recorded when a searcher 

uses the system 25 Permanent data set: Retained long tern (although it changes 

IP address 62— this is passively recorded when a searcher overtime) 

uses the system Temporary data set: Created only for the duration of the 

search 



Input data set 


Output data set 


Key-word (temporary) 


Hit-list - Ranked hit-list of 


Database to match the key-word with 


Web-pages (temporary) 


(permanent) 





Surfer trace: This is a measure of bow users search. It is a 



other 64 — this includes other personalization information 

such as search customization preferences, keywords for ^ ^ t t , „ , , T _ r 

web page suggestion etc. This information is entered 30 trace of ke X words the y ^ rch for - «■* URLs sabsG ~ 

actively once by the user then used to personalize the 1 uentl y selected and how long they spend there, from 

search results each time the users (identified by user which a ranking of web-pages for a users (surfers) can be 

ID) uses the search engine. calculated. It is a measure of which web-pages they found 

Data from web-page developers include: most useful after the key-word search. The combination 

URL 66— this is the URL address of the web page or 35 of aI1 surfer traces is used to create a users' choice hit-list, 
i pages that they wish to submit - - v 

description 68 — this is a 2-3 line description of the 

L information on their web-page input data set Output data set 

keywords 70 — these are the keywords that the web page 40 

developer WOUld like to associate their Web-page with' ? ^y-wonl(temporary) Surfer trace - A list of user 

*^ . s User selections from initial search web-pages users found useful for 

target audience 72 — these are the target audience (profile results (temporary), Le. Web pages each key-word (can be permanent or 

types 54) that the web page developer particularly want visited (URLs) temporary) 

to target. ^fETV ^ ^ 

IP address of user 

date-time 74 — this is passively recorded when every a 45 

web-page developer submits a web page 

Data from content providers include?^;, Users ' choice ti^t: This a semi-permanent ranking of 

- biaV7<rthese arc $ bids for content as described later.^ web-pages associated with every key-word and indicates 

.1 - A . . . , 1 « 1 .1 '"*■"* ' ? how useful Internet users found each of the web-pages 

content details 78-^his includes all details of content/ Q assodllted ^ te key _^, old ^ choice K tVU 

„ Providers including address, content details etc. fc u ^ ated b a new traoc . 
v Results from other search engines 80 — these are the 
results for a keyword search from other existing search 
engines. 

Outputs of the search engine 10 are: 55 input data set Output data set 

lists of web pages 90 — depending on the input data a list / " " ™ ~ 7~! 777"" T~T~! 

< _ , r 0 . r . ~ . , ~ Surfer trace (can be permanent or New Users choice hit-list - Ranked 

of web pages can be produced in web page determi- temporary) hit-list of "popular Web-pages 

nation Step 82, described further hereafter;^ Users* choice hit-list (permanent)* (permanent) 



/content keywords 92 — the search engine suggests other , 

/, j c 4 . , 1-1 j j . The initial users choice hit-list will be the surfer trace. 

keywords for users to try produced m key word deter- $o 

mination step 84, described further hereinafter; and > New web-page list: This is a list of new web-pages that is J 

content 94 — the search engine sends out selected content .J created by URL submissions from web-page developers, 

as produced in determine content step 86, described - } ^When a web developer updates a web-page,^^y c,an 

further hereinafter ^submit the web-page address, brief infoimation about the 

To facilitate ease of reference and aid understanding, the 65 page and a list of key-wordsthat'the developer decides are ^ 

aforementioned and subsequently mentioned data-set defi- ie je^ ant^rhc web?page is--tr^n^rMacgdcon ^the,Jqp of each 

nitions are reiterated and expanded upon below (and where c of the key-word^new web-pagejists. _/ 
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of key- words rather than Web-page The method for doing 
this uses a similar olgorithm to a spell-checker only it 



input data set Output data set docs it for phrases. It also suggest Key-words, based on 
previous URL selections from sequences of user key- 
All wcb-page developers information New web-page list (permanent) words 
about web address and key-words 



Content Provider's list: This is a list (associated with each ; 
key-word) of content providers which must typically > 
[that] pay to illustrate content with the key-word. The^io 
price-paid is dependent on the number of other content' / 
-providers, the amount they spend and the number of times - ; 1 
the key word is searched for. 



Input data set 



Output data set 



Key- word (temporary) _ / Ranked hit-List of other key-words the 
Database of all key-words used / \ user may want to try_ (temporary) 
(permanent) 7 - — v 



Input data set 



Output data set 



Key-word ' Content Providers list - a list of J 

Content Provider's bids for content' / content associated with each ^ 
' spots , key-word permanent) 



High-flyers hit-list: This a list of web-pages (associated with 
every keyword) that axe increasing in popularity at the 
highest rate. It is an indication of how rapidly web-pages 
are rising up the users' choice hit-list and it is used as a 
means to ensure that new emerging web-pages rise to the 
top of the users* choice hit-list. 



Surfer keyword list: This is a data set comprised a list of 
15 key-words that the individual user found useful after the 
key-word was selected. This is found by tracking which 
key-words the user decided to use. This is equivalent to 
the surfer trace. 



Input data set 


Output data set 


Old Users' choice hit-list - 


High-flyers hit-list: A ranked list of 


(temporary) 


web-pages that are rising in popularity 


New Users' choice hit-list - 


the fastest 


(permanent) 





Input data set 


Output data set 


Key-word 


Personal hit-list: A ranked list of 


Individual surfer trace - (permane 


nt) web-pages that an individual has 


found useful in the past 



Collective Search hit-lists. This can be a combination of any 
of the above hit-lists. There are many different ways a 
these hit-lists can be combined. 



Input data set 



Output data set 



Crawler hit-list (temporary) 



Collective Search hit-lists - (Default)' 



Users' choice hit-list (permanent) /Ranked hit-list of Web-pages/ 
Advertisers' list (permanent) £ displayed to the user after the key : j 
New web-page list (permanent) ^word search. It can be a combination 
High-flyers list (permanent) j of any of the hit-lists above f 

Personal hit-list (permanent) * (temporary) 



^Crawler key-word list: This is a list of key-word suggestions 
~ mat the user may find useful. This is found by matching 
the key-word entered by the user to the database of 
key-words and phrases that other users have tied. This is 
the equivalent of the crawler hit-list, though it is a ranking 



20 



Input data set 



Output data set 



25 



30 



35 



Key-word (temporary) Ranked list of other key-words 

Data about what key words were used (associated with the key-word) that 
from the key-word sugg ester this individual user found useful 

(semi-permanent) 



key-word suggesten This is a data set consisting of a 
permanent ranking of other key-words that users have 
found useful, compiled from successive surfer key-word 
lists and is linked to each key-word (this is equivalent of 
the users' choice bit-list). 



Personal hit-list: This a list of web-pages the individual user 
has found most useful for each key-word search they have 
done in the past It is like an automatic book-marking data 
set for each individual user. 



Input data set 


Output data set 


Surfer key-word list (temp or 


New users* choice key-word list 


permanent) 


(permanent) 


Hxi$ti"g users' choice bit list 




(permanent) 





User Based Search Algorithm 

The discussion provided above provides the language 
necessary to more fully describe the present invention As 
illustrated in FIGS. 3A and 3B, which provide an overview 
45 of the search engine capabilities according to the present 
invention in which web pages are selected in making up the 
search results provided to the end user. In step 112, the user 
enters up to 4 sets of data; keyword 52, profile type 54, 
search type 58 and User ID 56. The IP address 62 and 
50 date-time 60 are not entered by the user but can be read when 
a user uses the search e . This data is used is used in parallel 
in steps 114 and 116 to produce list of web pages. Step 114, 
discussed in detail hereinafter, is the process of selecting 
web pages from novel new search engine data sets produced 
55 in accordance with the present invention. This can run, if 
desired, in parallel with step 116 which obtains a selection 
of web pages from other existing search engines. Thereafter, 
selection of web pages from step 114 and 116 are combined 
and tagged in step 118. The process of tagging the list of web 
60 pages, describe In more detail below, enables a set of data, 
shown as surer trace data in FIG. 3, to be created and sent 
back to the search engine when the search engine user 
selects awe-page from the list in step 120. The process of 
selecting a tagged web-page creates the following series of 
65 data which is used to update the search engine data sets; 
keyword 124, URL 126, user ID 128, IP address 130, 



/date^meji!§2^^ 



US 6,421, 

9 

Although it is preferred to use all of these different data 
types in the surfer the data, use of different combinations of 
this data is filly within the intended scope of the present 
invention The description 134 will typically only be 
included in the preferred embodiment of the invention when 5 
anew site is added to the data set 114 of the search engine 
10, and the description used will be that description that 
appears on the original list of web pages. The date-time data 
132 may only indicate that a site was selected, rather tm 
record the period of time a user was at a particular site, as 10 
explained further hereinafter. This process is invisible to the 
user who, upon selecting the web-page from the list of web 
pages is taken directly to the corresponding URL, step 122. 
Details of the implementation of steps 114, 118 and 120 will 
be described in more detail hereinafter. 15 

After the initial selection the user may choose to access 
another of the web-page URL search results. Depending on 
the relevance of the site, the user may spend time reading, 
downloading, exploring further pages, embedded links and 
so forth, or if the site appears irrelevant/uninteresting, the 20 
user may return directly back to the search results after a 
short period. The time difference between the two selections 
is recorded as the difference between two date/time data 132 
from subsequent selections from the list of web page 
searches (in this embodiment one can only measure the time 25 
spent at one web page if another selection is made after 
visiting that web page — this then provides another surfer 
trace 132 which allow a time difference to be calculated). 
This surfer trace data on the popularity of web pages is used 
to the subsequent searches, as described further hereinafter. 30 

Thus, according to the present invention, it is the human 
users' powers of reasoning and analysis that is beg used to 
establish the relevance of the different results to the subject 
matter of the search. The present invention utilizes tie 
cumulative processing and reasoning of all the human users* 35 
to provide a vastly more effective means of obtaining the 
required information sources than is presently possible with 
the type of method described above 

As described above, human brain power is captured by 
recording which web pages the user goes to after each 40 
keyword search. According to the present invention, collect- 
ing the surfer trace data is achieved by sending, in the list of 
web pages generated by the search to the user, hidden links 
that will automatically send information back to the search 
engine (or a subsidiary server). While the user only sees that 45 
his intended link is displayed, the hidden link notifies the 
search engine of the transfer, which process can be executed 
with a Java applet. Thus, when the Internet user selects a 
web-page it takes the user to that address but also sends off 
the surfer trace data to the search engine 10, which notes 50 
what has been selected When the user returns to the list of 
web pages and selects another web page listing, another Java 
applet is then executed which creates another surfer trace. 
The difference between the data time data in this surfer trace 
from two sequential selections captures the time period that 55 
the user has been at the previous web site. This occurs 
without the use knowing this data is being sent. 

In another embodiment, rather than using multiple Java 
applets to collect a complete list of surfer trace data, there is 
no description data 134, and the date-time data 132 indicates 60 
that a user visited a particular web site. In one specific 
embodiment, the user must visit a particular web site for 
greater than a predetermined period of time, such as one 
minute or fifteen minutes, depending on what is an appro- 
priate time to have looked at the site for the visit to the site 65 
to count and for any surfer trace data to be sent back to the 
search engine 10, as well be described hereinafter. In this 
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embodiment, each applet contains all of the information 
necessary to update the database at the search engine. 
Another embodiment collects the surfer trace data prior to a 
user navigating to the intended web site. Other ways of 
obtaining this surfer trace data are possible and are within 
the intended scope of the present invention. 

Thus, the search results page according to the present 
invention is therefore differently formatted from conven- 
tional search engines* results pages. The difference is in 
action rather than content. Visually, the page looks the same 
to the user as standard search results from other search 
engines. An example illustrates this point: In a conventional 
search the results page for a search of the keyword 
"Weather** may read: 1. www.weather.com Today's weather 
forecast. Today is expected to be fine ad sunny everywhere. 

The HTTP link associated with the "www.weather.com** 
label is "httprl/wwwweather.com**. This means that if the 
user selects this link, they will navigate to this page directly 

In contrast, according to the present invention, the tagged 
result page for the search made suing the keyword 
"Weather** may read 1. www. weather com Today's weather 
forecast. Today is expected to be fine and sunny everywhere. 

The HTTP link associated with the "www.weather.com" 
label is link.asp?n=l." If the user selects this link, therefore, 
in a process is invisible to the user, the user is first directed 
to the link asp page on the site corresponding to the web 
server using the search engine 10 according to the present 
invention, and pass parameter n with value 1. 

Server side code (application code that runs on the web 
server) uses this parameter to identify Me URL and descrip- 
tion of the user's chosen site, This information is then stored 
in a database Table along wit other surfer trace data. The 
server side code then executes a redirect operation to the 
user's required URL. The user then sees their required page 
appear. 

The source of search results is independent to this activity. 
The destination page of the user is independent of this 
activity. The process is one of recording a user keyword and 
destination into a database. This method of tracking can only 
record the initial web-page visited after a keyword search. If 
the user continues to return to the search results list then 
subsequent web-page visits can be recorded. 

The surfer trace data that is sent back to the data sets 114 
of the search engine 10 as a result of the user selecting the 
we-page can be encrypted to prevent fraudulent users from 
sending fake data to the search engine. 

Another method of tracking where a user may connect to 
from an initial URL selection (if they do not return to the 
search result page) is to run the selected web-pages as part 
of a 'frame' located at the search engine web-site. This 
permits a complete record of the web pages visited to be 
recorded after a keyword is entered. However, this imposes 
an additional level of complexity to the system with a 
possible decrease in system response time. 

As previously mentioned, the surfer trace data that can be 
collected includes keyword 124, URL 126, user ID 128, IP 
address 130, date-time 132, brief web page description 134, 
and is identified as such since it provides a trace or record 
of how searchers (surfers) use the search engine. This data 
is used to improve future searches building on the prefer- 
ences of previous searchers. The surfer the is thus a measure 
of the preferred choices of an individual user or web 
'surfers' from the initial search results for a particular set of 
key-words. 

How the Data Sets are Created that Determine the List of 
Web Pages 

FIG. 4 illustrates the data sets used for different web-page 
searches according to the present invention. The data sets 
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(tables) that are used to determine the list if web pages 
include keyword table 164, profile ID table 166, security 
tabic 168, cumulative surfer trace table 170, keyword URL 
link table 172, personal link table 174, and web-page (URL) 
table 188. 

The structure of the aforementioned data sets are 
described in more detail hereinafter. The descriptions that 
follow show the data arranged in a spreadsheet fashion, with 
multiple values per cell and many blank cells. Illustration in 
this manner is convenient for explaining the present 
invention, but is not an efficient storage and retrieval 
method. As will be apparent to those skilled in the art, a 
/ relational database model would be used to implement the 
~ data storage according to the, present invention such that 
there may be multiple fields or Tables involved to store the 
data and each field will store only one value. 
Keyword Table (164) 

The contents of keyword data table 164 of FIG. 4 are 
shown in more detail in Table 1 shown below, and is a list 
of keywords ineluding.phrases,,and ^^^m^mofMmtsi\i6y/ 
; have been requested. If the list becomes unmanageably 
' large, the key-words that are not used again after a prede- 
termined time period could be deleted from the list. How- 
ever is would be desirable to keep the majority or all 
keyword phrases that are entered, if possible. 

TABLE 1 

List of information requests and the number of times it is requests 



10 



20 



25 



30 



Key-word 



Cumulative number of times the key- 
word is requested (W) 



Unique number for 
each key-word 



Key-word 1 
Key-word 2 
Key-word 3 
Key-word 4 
Key-word 5 
Key-word 6 
Key-word 7 



Wl , W2, W3 etc 



35 



40 



The cumulative number of times a keyword is requested 
may be segregated according to the different "users profiles" 
selected (Wl, W2, W3, . . . ),e.g. Wl=total searches, 
W2=male profile, W3=Female profile, W4=USA profile and 
so forth. It should be noted that the sum of W's will be 
greater that the total number of times a site has been visited 
because the user may fall into more than one profile category 
e.g. a male-(W2) from the USA(W3). This would become 
a list of not only the number of user searchers using that 
key-word but also a list of the type of user (according to the 
profile type selected) searching for that keyword. Keywords 
that mean the same thing in different languages are different 
keywords, as long as the spelling is different, although they 
could be related using the keyword suggester, as described 
hereinafter. 
Web-page Table (188) 

The contents of web-page table 188 of FIG. 4 are shown 
in more detail in Table 2 shown below, is and contains a list 
of Internet web-pages. Each web-page has a URL address,^60 
an associated 2-3 line description, a unique web page 
number for each URL(which can also be any character, 
symbol code or representation) and the cumulative number 
of times the URL has been visited The URL address will 
have a unique number (which can also be any character, 
symbol code or representation) assigned to it rather than 
storing the full URL string in the subsequent data-Tables. 



45 



50 



55 
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TABLE 2 



List of Information suppliers and a description of the web-page 

Unique number 
2-3 line for each Frequency the URL 

Address description URL address (web page) is visited 

URL address 1 
URL address 2 
URL address 3 
URL address 4 
URL address 5 
URL address 6 
URL address 7 ... 



Keyword URL Link Table (172) 

The contents of keyword URL lilt table 172 of FIG. 4 are 
shown in more detail in Table 3 shown below. This table is 
of particular significance with respect to the present inven- 
tion because it contain information about the inks between 
information supplies (URL addresses or web pages) and 
information requests (Keywords). 

This data is recorded in further data sets which describes 
the relationship between the Key-words and occurrences as 
defined by the following three parameters. 

the cumulative number of significant visits (hits) to each 
^yf URL addresses corresponding to each key-word (herein 
preferred to as X or weighting factor X). This is a 
^mcasurc^of thepopularity±of the URL for each keyword, 
^d is determine from the surfer, traces^ 
the previous cumulative number of significant visits mea- 
sured at an earlier predetermined instant; (herein 
referred to as Y or weighting factor Y) 
a date time factor relating to the instant of the creation or 
input of each said web-page(herein referred to as Z or 
weighting factor Z). Z is the data time in which a 
web-page developer submitted a web-page to the 
search engine. 

Not all combinations of key-words and URL addresses 
will have data for X, Y and Z. 

TABLE 3 

Links between information suppliers (web-pages) and information requests 
(key-words') 

Key-word Key-word Key-word Key-word Key-word 



URL 


X,Y, Z 


address 1 




URL 


X,Y, Z 


address 2 




URL 


X, Y, Z 


address 3 




URL 


X, Y, Z 


address 4 




URL 


X, Y, Z X, Y, Z 


address 5 




URL 




address 6 




URL 




address 7 





65 



Profile Type s with the Keyword URL Link Table 

The popularity of web pages will be different for different 
groups of people. The inclusion of multiple profile type s 
will produce multiple values of X Y and Z in Table 3, e.g. 
one may have a Global and New Zealand popularity rating 
denoted by XI X2 Yl Y2 etc. 
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URL address relating to Rugby XI = 520, X2 - 52 

URL address relating to Basketball XI ° 4000 X2 - 20 



In his example the global popularity (using the general 
profile type ) for the Rugby and Basketball URL addresses 
are 520 and 4000 respectively and 52 and 20 respectively for 
the New Zealand profile type. 

When the general profile type setting is used (ranked 
based on XI), the Basketball site would be ranked at the top. 
When the New Zealand setting is chosen (ranked based on 
X:2) the rugby site would be highest. This would be a 
reflection of the preferences of the New Zealanders. This is 
a very simple method of storing the preference of different 
groups of people. One would expect New Zealand-based 
rugby web-sites to rate higher than an overseas site on the 
New Zealand list, but there is no reason that this has to be 
the case. Someone in Spain may have the best Rugby site in 
the world. The system evaluates web-pages only on the 
perceived quality of information by the users — the physical 
location of the site is immaterial. 

There could be a vast range of X values representing 
different countries, occupations, sex, age and so forth, 
enabling, the popularity of different groups to be captured 
very simply. Users could choose to combine any of the X 
values according to their personal interests/chacterisics. 

As an example, if say, 

XI is for males 

X2 is for females 

X3 is for New Zealanders 

X4 is for USA 

X5 is for engineers 

X6 is or lawyers . . . 

A "male" and a "New Zealander" would using the search 
engine increment both X3 and XI. This facility would 
increase the data requirement of the system but it could 
vastly improve the search results for different users. The 
total popularity of the web-page needs to be stored as a 
separate number as users may contribute to more than one of 
the groups of people. The sum of all of the individual 
popularity's would be greater than the total popularity 
because user can belong to more than one profile type. 

To simplify the system for the user there would be a 
default profile type (selection of X's) with an option is to use 
other profile type s to do specific searches. For example, a 
user may have a default profile type of a New Zealand male, 
but if a technical search is required a "global engineers" 
profile type may be chosen that reflects the cumulative 
search knowledge of engineers around the world. 

The extent of personalization could be dependent on the 
frequency of searching. For example, common keywords 
such as "news" would have a high degree of personalization 
(a large range of X values) and less common key-word such 
as "English stamps'* would have little or no personalization 
(only a global X value). The degree of personalization could 
be a function of the frequency that the key-word is used 
(found from Table 1). 
Cumulative Surfer Trace Table (170) 

The contents of cumulative surfer trace table 170 of FIG. 
4 are shown in more detail in Table 4 shown below. 
Information about the links between web pages and key- 
words in Table 3 ( also referred to as keyword URL link table 
172) is updated by the surfer trace data. The cumulative 
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surfer trace is the combined information from all individual 
surfer traces and it is used to determine how many "hits" 
(significant visits) each web-page had for each key-word. 
The information collected from each individual surfer 
5 trace is a series of inputs previously described, and shown 
below in Table form 

I ABLE 4 



Each row is one surfer trace and the combined rows are the cumulative 
surfer trace 

IP Number User ID Keyword URL (webpagc) Date-time 



The way the surfer trace data is processed to update Table 
15 3 is described further hereinafter. 
Profile ID Table (166) 

The contents of profile ID table 166 of FIG. 4 are shown 
in more detail in Table 5 shown below. This table includes 
a unique identification, password, contact email and a 
20 default profile type which they normally use to perform their 
searches. 



TABLE 5 



25 





User identification Table 




User 


Default 


Other 


identification password 


email profile 


information 


Joe Bloggs dogs 


jbIoggs@AOL US, Male 





30 



The users default profile type is stored as the part of the 
user's personal preferences profile, which would accessed 
by entering some form of personal identification to the 
system. This information could be supplied when logging on 

35 to the data search engine or the search engine could leave a 
"cookie", as that term is known in the art, on the computer 
to identify a user, (there would be an optional e-mail address 
and password (or similar) associated with the logon 
procedure). The IP address itself would not be a sufficient 

40 means of identification as it is not necessarily unique to the 
individual users. 

The other information can include user defined prefer- 
ences for how the search results are combined and keywords 
that are of particular interest to the user. This information 

45 can be used to actively customize the search results and 
suggestions of web pages to visit. 
Personal link Table (174) 

The contents of personal link table 174 of FIG. 4 are 
shown in more detail in Table 6 shown below. Table 6 is 

50 identical in structure as Table 3, and can be used to record 
a users personal preferences relating to each URL including 
the number of times visited and the key-words. In this Table 
6, however, Z is not the date that the web-page developer 
submitted the web-page by it is the date-time that the user 

55 visited the web page. This allow the users could refine a 
search by defining the last time they visited the web page. 

TABLE 6 

gQ Links between information suppliers (web-pages) and information requests 
(key-words') for an individual user 

Key-word Key-word Key-word Key-word Key-word 

URL x,y,z 
address 1 

65 URL x,y,z 
address 2 
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in Step 176, as shown in FIG. 4, web-page developers can 
TABLE 6-contimied <L submit a URL 187 ad description 68 which also ; goes: 
directly into the web-page data table 188, ~ 

links between information suppliers (web-pages) and information requests ^web Crawlers may also add XJRL addresses and descrip- 

0"Y-words)f O ran individual user $ ^ ( me description is either the first few lines of the 

Key-word Key-word Key-word Key-word Key-word < ^web-page Of in the HTMLcoded "title"). This is not an 

^ essential element of the system but it could be a method 

URL x,y,z to obtain URL's and descriptions. With this search , 
address 3 system web- crawlers. are _ more likely to be- used to 
x ' y,z * 10 < ver ify me information rather than find new information^ 
url j^yjt Populating the Cumulative Surfer Trace Table 170 "~" 
address 5 The cumulative surfer ace table 170, also referred to 
URL above as Table 4, is populated each time a "tagged" web- 
address 6 pa g e ^ selected by a user. This sends a packet of surfer trace 
? information, such that the surfer trace data is added to the 

a rcs& 15 table each time the user selects another web page from a web 

page list. 

The data in Table 6 is only accessed by the individual that Populating the Keyword URL Link Table 172 

created it, and accessible using a user ID that is preferably The data from the cumulative surfer trace 170 is used to 

independent of changes in the user's e-mail or IP address update the popularity of web pages as recorded in Table 3 

changes and would thus enable their past personal prefer- 20 (X, Y), also referred to as the keyword URL link table 172. 

ences to be retained during such changes. Th e 

This Table 6 data set could be stored either at the search frequency of updating Table 3 with the data from the 

engine site or on an individual's computer. Storing on local cumulative surfer race (170) to obtain new values of X and 

PC's would require additional software to be installed on the Y ^ a variable that can be changed, from ranges that are 

users computer. There are numerous advantages to storing 25 shorter ^an every hour to longer than every ^month. It should 

the information at the search engine including the fact that * ^ "^t different keywords can be updated at different 

i*i i . 4 . j i-i i * i_ intervals of time, 

users are likely to go mere more often and unlikely to change A • * j- * .u i *- _r 

. • .i_ i_ i_ . ^ 1 1_ i j * • An intermediary step in processing the cumulative surfer 

search engines once ^they have a substanUal book mark list. ^ ^ to fofm surfer hit table. This is 

Security Table (168) subsequently used to modify the values of and X, Y in Table 

The contents of security table 168 of FIG. 4 are shown in M 3 

more detail in Table 7 shown below. To ensure that users do As mentioned above, the simplest method of recording a 

not submit the same key-word over and over to increase its link ("useful visitor "hit") between a keyword and a URL 

popularity the following security data table is used. Each would be to count each keyword, URL paring in a surfer 

entry is a single piece of information ix. yes or do. This table trace as a "hit". A more meaningful and sophisticated 

can be created for links between keywords and IP addresses 35 method is only to count a location selection as a valid if the 

or links between keywords and User ID's. user meets certain criteria. This criterion could be the user 

exceeding a specified time at a location. If this criterion was 
TABLE 7 not met » ti* e selection would not be increase the cumulative 
value of X in Table 3. 

Security Table to ensure one computer user does not submit keywords to 40 It is also possible to increment the value of X based on the 

artificially boost the popularity of a web-page time spent at the web page. The longer the time spent the 

^ , ., _ „ v 1 ~ v , A more this increments the value of X. X does not have to be 

Key-word 1 Key-word 2 Key-word 3 Key-word 4 ... 

a whole number. 

IP address i l Due to &he variations in web-site capabilities in terms of 

IP addresl 3 3 45 *°£~° n ^ mes » down loading times, bandwidth, and response 

IP address 4 l times, the predetermined time used to denote a valid 'hit* 

IP address 5 a may be suitably altered. Specialist web crawlers may be 

employed to independently validate such data. 

^^j..- . . , , The selection of a content provider's banner after a 

Described heremafter are the processes that are used by 5Q fd ^ a ^ for ^ web 

the present invention to populate each of the FIG. 4 tables (increment ^ value of ^ ^ ^ enable their web pages 

mentioned previously. to me popularity list associated with the 

Populating the Keyword Table 164 ' keyword This acts as a mechanism to enable a web-page 

This table is populated every time a user enters a keyword developer to pay to be seen with a keyword: They can not 

52 to the search engine. A submitted keyword I is compared ^ to g0 up me popuUliiy Hst-this will only occur if 

to the keyword list in Table 1 (keyword table 164) and added - le ^ their sjle ^ spasd ume there ^ a valjd 

if it is not already present. If it is present, the cumulative hit for the list ^ yalues of a contem hU ^ yary 

number is increased by one. If the user has a profile type then ( if could be 1 or 0.5 or 7) depending on the emphasis one 

the cumulative number for the ^yword for each type of wants t0 lace how much mat content affects the popularity 

profile will also be incremented (W1,W2 W3 etc). ^ rankmg 

Populating the Web-pagc Data Table (URL Table) 188 cumulative surfer trace information can be processed 

This table is populated in a number of ways, including: m a large number of ways to populate Table 8 (below), 

user selecting a URL address 126 that is not already in / Grouping the cumulative surfer trace according to the IP 

Table 2 (URL table 188). The URL address 126 and addresses or user ID produces the search pattern for an 

^description 134 are put directly into the web-page data 65 individual users. This is a list of key-words and URLs and 

table 188. The new URL is assigned a unique identi- times. This allow the time spent at each web-page to be 

fication number. calculated for each user (it is not possible to calculated the 
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Lime spent at the last web pages of a search session as there 
is no time record after they go to that web page) 

If the time between each visit is longer than a certain time 
period, one is added to the cumulative surfer hit (a) 
table for the key-word URL. (this is the simplest 
method, methods in which relevancy is proportional to 
the time spent at the site, for example, are also properly 
within the scope of the present invention). 

TABLE 8 

cumulative surfer hit table created from accumulated surfer traces 

Key-word Key-word Key-word Key-word 

URL address 1 

URL address 2 a a 

URL address 3 a a 

URL address 4 a 

URL address 5 

URL address 6 a 

URL address 7 a 

The cumulative surfer hit is used to update the value X in 
Table 3 in the following way X (nw) =(X (oW> . HF)+a. 

HF is the history factor which is a umber between 0 and 
1. The history factor does not have to be the same for every 
key-word and could be varied depending on the rate at which 
the keyword is used 

The data collected for Table 9 is used to recalculate the 
values of X in Table 3 after a predetermined time period. The 
frequency of updating Table 3 will influence the value of the 
History factor (HF) chosen. The reason for multiplying the 
existing X by a "history facto** is so that the perceived 
popularity does not last indefinitely. The history factor 
reduces the weighting attached to the past popularity. To 
illustrate by way of an example, the key-word "sports news" 
may have an existing popularity with the following ranking 
(based on the number of hits per web-page, X) 



1 Winter Olympics web-page 

2 Soccer results web-page 

3 Baseball results web-page 

4 Golf news web-page 



X - 19000 
X - 18000 
X - 15000 
X - 15000 



The cumulative surfer hit Table for a week may be: 



1. Winter Olympics web-page a - 500 

2. Soccer results web-page a - 1800 

3. Baseball results web-page a = 1500 

4. Golf news web-page a - 4600 



The reason for the change in the number of hits reflects 
the fact that the winter Olympics has finished and the Master 
golf tournament has stated. If one has a "history factor** of 
0.9 then the new popularity (X) will be: 



10 



15 



20 



30 



45 



50 



1 Golf news web-page 

2 Soccer results web-page 

3 Winter Olympics web-page 

4 Baseball results web-page 



18100 (0.9 x 15000 + 4600) 
18000 (0.9 x 18000 + 1800) 
17600 (0.9* 19000 + 500) 
15000 (0.9 x 15000 + 1500) 



Thus, the more popular web-pages can emerge and the 
less popular decline, reflecting the fluctuation of interest 
over time in different subjects and events. 

The database is therefor utilizing the human mind to 
provide a powerful indication of what people find useful on 
the Internet. The users themselves replace a substantial 
computation requirement that would otherwise be required 
to filter through such searches. 

The value of Y in Table 3 is the old value of X, and the 
value of Y will be updated at intervals that are deemed 
appropriate, which interval could be minutes, hours, days, 
weeks or longer. The update interval does not need to be the 
25 same for all different keywords, as previously mentioned. 
This is used to calculate the rate of change of popularity of 
web pages and can be used as a selection criteria. 
Different Profile Type s in the Web-page/URL link Table 

The cumulative surfer trace includes information on users 
profiles so Table 8 can be calculated with subscripted values 
of a for different profile types. These values of a 13 etc 
would correspond to the profile types for the subscripted 
values of X. This allows the popularity of different groups of 
35 people to be recorded. 

New Web-page Data input to the Web-page/URL Link Table 
172 

The simplest method of having new pages recorded by the 
^^search engines is for web-page developers to submit 
information, shown as action 176 in FIG. 4, which infor- 
mation includes URL 66, key-words 70, site descriptions 68, 
target audience 72 and date-tie 74, each time they create or 
update a web-page. 

This information directly updates Tables 2 (URL table 
188 of FIG. 4) and 3 (Keyword URL link table 172 of FIG. 
4). The URL 66 and description 68 are entered in Table 2 and 
the date-time (74) at which the page is submitted (the Z 
value) is inserted in Table 3 for each of the keywords (70). 
Users are allowed a set number of keywords 70 with which 
they can submit their web page. An example of what Table 
3 would look like with just Z values is given below (format 
dd-mm-yy). 



TABLE 9 



Data Table created from submission by web developers 

Key-word Key-word Key-word Key-word Key-word Key-word 



URL address 
URL address 
URL address 
URL address 
URL address 
URL address 
URL address 



27/02/98 

28/02/98 28/02/98 



29/02/98 



27/02/98 



18/02/98 



28/02/98 



28/02/98 



18/02/98 18/02/98 
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, If there is, no date for the combination of the4^gjy^nd 
"keyword in Table 3, then- the new date is automatically 
inserted. If a date already exists in the Table, then the dates 
are compared and if the dates are too close, i,e. less^harra 
prc-detejminejcLperiod, then the old date remains and the 
new date is ignored. This stops people -from constantly 
resubmitting to get on the top of the new web page list by 
resubmitting their web pages. If the URL in Table 3 has other 
keywords with values of Z closer than the pre-determined 
^period .then the submission is also not allowed. This stops 
web-page developers from resubmitting their web pages 
with different sets of keywords. 

When users submit a URL they could target it at specific 
types of users (different profile type s Zl, Z2, Z3 etc) as per 
Table 3. For example, an URL submission specifically 
targeted at New Zealanders (e.g. Zl) will appear at the top 
of keyword new list when New Zealanders search for that 
keyword. It will remain at the top until someone else submits 
a URL for that keyword targeted at New Zealanders. URL's 
that are targeted at other audiences will not appear as new 
sites for New Zealanders or alternatively they will not 
feature as high in the new list as the ones specifically 
targeted at New Zealanders. 

The data on new web pages does not necessarily have to 
be entered by web-page developers. It could be automated 
by having a web document template that automatically 
submits data to the search engine whenever the information 
on the web-page has been significantly changed. It would 
prompt the web-page developer to change any key-words as 
appropriate. 

Another embodiment requires sending specialist crawlers 
out to find web site addresses and key-words, though this has 
many of the drawbacks of existing web-crawlers. It could 
only be effective if web designers deliberately configured 
their page with the key-words identified. Any web site 
designer/proprietor willing to do this would also presumably 
'be willing to submit any updates to the search engine to 
benefit from the instantaneous listing on the search results. 

An extension of this principle is to auto-detect if a web 
address possessed key-word information in the database and 
then automatically send an invitation to provide the infor- 
mation to enable their web-page to be found easily. The ideal 
number of key-words to be submitted with each web-page is 
preferably less than 50 and probably preferable within the 
range of about 5 and 20. This also advantageously forces 
web-site designers to find the most appropriate keywords to 
describe their site and also enable them to choose the 
audience they wish to target. 

The web-page submission process may also include web- 
page developer identification process that restricts the ability 
of people to use the system fraudulently. This may include 
a payment to prevent multiple web-page submissions. 
Populating the Profile ID Table 166 

ID table 166 of FIG. 4 is populated from the direct inputs 
from users. When users search the can choose their profile 
type 54 from a layered drop down menu, which could 
include, for example: 

Gender (Male or Female) 

Occupation (Professional, student etc) 

Age category etc 

The user selects different profile types from the options 
they are prompted if they wish to save this as their default 
profile type. This is then recorded in Table 5 (profile ID's 
table 166). The user may also select personalization options 
from a specific personalization options page rather than a 
drop down menu on the search page. 
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Populating the Personal Link Table 174 

The cumulative surfer trace is used to identify the search 
patterns of individual user based of sorting by User ID 126. 
This information is used to update the personal link table 
5 174 in the same way that the cumulative surfer trace 170 is 
used to update Table 3 (keyword URL link table 172). This 
table stores users past preferences as a form of automatic 
book marking. 

Populating the Security Table 168 

10 Each time a user enters a keyword 52 into the search 
engine it updates the security table 168 (Table 7) by making 
a link between the keyword 52 and the IP address 62 (or 
making a link between the keyword 52 and the User ID 56). 
The data in Table 7 is cleared periodically as the purpose is 

15 to stop systematic repeat searching from affecting the popu- 
larity lists (value of X in Table 3) rather than stopping 
individuals who occasionally perform the a repeat keyword 
search from affecting the popularity list. 
Determining the List of Web pages 

20 FIG. 5 shows the various data sets previously described, 
and various inputs and actions that result in a list of 
suggested web pages being provided, and will be described 
in more detail hereafter. As shown in FIG. 5, user data 
entered into the search engine can include: keyword 52, user 

25 ID 56, search type 58, IP address 62, profile types 54. How 
this data can be used to determine a list of web pages 250 as 
well and deciding which of the list of web pages to tag (step 
118 of FIGS. 3) for the purposes of creating a surfer trace is 
described hereinafter. 

30 The numbers (X, Y and Z) in Table 3, which correspond 
to keyword URL link table 172 in FIG. 5 contain all the 
information required to give the following types of searches 
58: 

Popular-list search ranked hit-list of the most popular 
35 URLs for that keyword based on the number X 

Hot off the press search ranked hit-list of newest URLs for 
the keyword based on the date/time (Z) 
/High-flyers search ranked hit-list of best emerging URLs 
'based the difference between X and Y J 

40 

Random search hit-list that is a random sample of URLs 

that have any of the numbers X, Y or Z 
Date created search this is hit-list based on the date time 
Z and the user-specified date of interest (not just the 
45 newest) 

The personal link table 174 also allows past preferences 
to be listed as search results 

Previous favorites search is a ranked hit-list base on the 
previous popularity for the individual (X from Table 6). 
50 jhis search is based only on the previous searching of 
the individual user. This allows the users to very 
quickly find site that they have previously visited. 
A number of other search options are also available. 
Conventional search is We list of search results from a 
55 normal search engine (116 FIG. 3) 

Other content only search. This is a list of other content, 

such as advisements, associated with the key-word. 
These search results can be combined in a number of 
m different ways 

Collective search ranked hit-list that is a collection of any 
of the search hit-lists described above (this is the 
default set of search results) 
Customized search ranked hit-list that can be a user 
65 defined combination of any of the above lists. 

FIG. 5 also illustrates the use of keyword tibia 164 and 
security table 168 in a decision 246 to send out tagged web 
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pages. This decision is based upon the frequency of key lated. An alternative method would be to rank the rate of 

word usage, the data in the security table and the presence change of popularity by the number of places they rose 

of a user identification. The details of the decision to send compared to last time. 

out tagged web pages is described fully in FIG. 16. The formula of calculating the rate of change of popu- 

How the Different Types of Search Lists are Implemented 5 larity for this embodiment is given by: 

More details on how each of these types of searches is ^ ^ 
implemented is provided below along with some of the 

advantage and disadvantages of each. The system relies on where X„ is the maximum value of X for the corresponding 
the brain power of the user, this time to determine what sort key-words and p is an additional variable that can be 
of search they want to do which will depend on what they 10 changed to alter the relative significance of changes at the 
want to find. The search methods are described easily so top and bottom of the popularity list, 
users should intuitively know which one to use. The reason for multiplying by the maximum value of X is 
Popular Search to ensure that small changes at the lower popularity levels do 
FIG. 6 illustrates the process for determining a list of not swamp more significant changes higher up the table. For 
popular web pages associated with the entry of a keyword 15 example, a web site having previously recorded only one 
270 in step 272. If this search is selected and a keyword is- selection and then attracting 5 hits the next day would 
entered, step 274 follows and produces a list of web pages^ exhibit percentage increase of 500% whilst another web- 
based oh the values of X taken from Table 3 (172, FIG. 5) ^ page may have experienced an increase from 520 hits to 
for the keyword 270 entered. These web pages are identified ' 4000 hits (a much more significant increase) though this 
by^a unique web-page(URL) number from Table 3. 20 would otherwise appear as a lower percentage increase. 
Thereafter, in step 276 the list of web-page numbers found } Random Search 

from step 274 is combined with the URL address and This is a random selection of less-popular web-pages for 

web-page description from Table 2 (188 FIG. 5). In step 278 the user that want to look at web-pages off the beaten track, 

the resulting list of web pages is then tagged, depending on ; based upon a random selection of web pages that has any 

the results of step 246 in FIG. 5 as described previously, and 25 value of X, Y, and Z associated with a keyword that is 

sent to the user for them to make their selections. Hot off the entered. Accordingly, after a user enters a keyword in step 

press search. 352 as indicated in FIG. 9, reference is made to the keyword 

FIG. 7 illustrates the process for determining a list of new URL link table 172 illustrated in FIG. 5, and a random list 

web pages associated with the keyword entered in step 290. of web pages numbers are generated automatically using a 

If this search is selected and a keyword is entered, step 294 30 random number generator are determined, as illustrated at 

follows and produces a list of web pages based on the values stop 354. Only web pages that have values for X, Y or Z 

of Z taken from Table 3 (keyword URL link table 172 of associated with the key word are chosen in this random 

FIG. 5) for the keyword entered in step 290. These web selection as this indicates that at some stage in the past as 

pages are identified by a unique web-page (URL) number used or web page developer thought the web page had some 

from Table 3. Thereafter, in step 296 the list of web-page 35 connection to the keyword. Thereafter, in step 356 the list of 

numbers found from step 294 is combined with the URL web-page numbers found from step 354 is combined with 

address and web-page description from Table 2 (URL table the URL address and web-page description from Table 2 

188 of FIG. 5). In step 298 the resulting list of web pages is (188 FIG. 5). In step 358 the resulting list of web pages is 

then tagged depending on the results of step 246 in FIG. 5 then tagged, depending on the results of step 246 in FIG. 5 

as described previously, and sent to the user for them to 40 as described previously, and sent to the user for them to 

make their selections. make their selections. 

The user will also be able to see exactly when each Conventional Search 

web-page was submitted so Internet users can be aware of its This is the normal search method of a conventional search 

currency. An indirect consequence of this feature is the engine, referenced as other search engine 116 in FIG. 3, 

incentive for web designers to update their sites. The promi- 45 which may or may not be included along with the searches 

nence given to new and updated sites provides a means of according to the present invention, at the option of the user, 

becoming established on the popular hit-list and encourages as noted previously, 

the use of appropriate key-words and rewards the up keeping Content* Only Search 

of web pages that users find useful. k Triisis a list of content, such as advertisements, associated 
High-flyers Search 50 /wit the key-word,, which the user cannot contfoly^^^ncs 
FIG. 8 illustrates a high-flying web pages search associ- ^that have paidlfie most will be at the top of the list, as 
ated with the keyword entered in step 320. This is a list of / described further hereinafter, in accordance with the p re- 
web pages that ate increasing in popularity fastest. If this ferred embodiment of the invention. Of course, other sys- 
search is selected and a keyword is entered, step 324 follows terns for identifying the order of paying content providers 
and produces a list of web pages based on the relationship 55 can also me implemented, 
between the values X and Y taken from Table 3 (172, FIG. Previous Favorites Search 

5) for the keyword 320 entered. These web pages are FIG. 10 illustrates a previous past favorites search, that is 

identified by a unique web-page (URL) number from Table based only on the previous searching of the individual user. 

3. Thereafter, in step 326 the list of web-page numbers found This allows the users to very quickly find sites that they have 

from step 324 is combined with the URL address and 60 previously visited and performs, therefore, automatic book 

web-page description from Table 2 (188 FIG. 5). In step 328 marking. It should be noted that since a password is pref- 

the resulting list of web pages is then tagged depending on erably used to logon to the search engine system according 

the results of step 246 in FIG. 5 and sent to the user for them to the present invention, the user will be able to access their 

to make their selection. personal preferences from any computer. 

The high-flyer list is calculated by comparing the old 65 Thus, when the user types in a keyword at step 372 as 

popular ranking (Y) and the new popular ranking (X) from indicated in FIG. 10, step 374 follows during which it is 

Table 3. From this the percentage increase in hits is calcu- determined what are the favorite sites (based on previous 
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usage) for that keyword from the personal link table 174 bers found from step 404 is combined with the URL address 

illustrated in FIG. 5. Because the user has a password that and web-page description from Table 2 (188 FIG. 5). In step 

can be used to logon to the system the user will thus be able 438 the resulting lit of web pages is then tagged, depending 

to access their personal preferences form from any com- on the results of step 246 in FIG. 5 as described previously, 

puter. 5 and sent to the user for them to make their selections. 

Due to this search capability there is, therefore, no need Customized Search 

to manually bookmark web pages. If a user forgot to FIG. 13 illustrates a customized search that allows the 

book-mark a good site on, for example, ' marbles', they can user to decide how they want their default hit-list to appear, 

easily find it by retyping the keyword that lead them to that In step 462, the keyword and User ID is selected in order to . 

site. If a user's preferences change they will be reflected in 10 initiate the customized search. Prior to initiating the cus- 

the personal links table 174. tomized search in step 466, which step is identical to step 

Another embodiment of the personal preference search 404 of the collective search previously described with 

includes specifying the date the web page was list visited, respect to FIG. 1, however, step 464 is applied to customize 

with or without using a keyword. The web pages are then the users default mixture of hit-lists For example a user may 

ranked based on Z in personal links table 174 of FIG. 5. For 15 want their default search results to include only popular and 

example if a user looked at a site in the middle of last year new web pages but no high flying web pages. This custom 

the user can refine the search by date, thus making it easier search is then performed in step 466 to generate a list of web 

to find a previously useful web-pages more easily, even if page numbers. Thereafter, in step 468 the list of web-page 

they could not remember the relevant keyword numbers found from step 466 is combined with the URL 

This automatic book-marking feature can also act as a 20 address and web-page description from Table 2 (188 FIG. 5). 

device for monitoring the type of Internet use being under- In step 470 the resulting list of web pages is then tagged, 

taken by a particular computer and thus for example, can depending on the results of step 246 in FIG. 5 as described 

provide warning to parents/employers of children/ previously, and sent to the user for them to make their 

employees accessing undesirable sites, such as adult web- selections, one preferred embodiment, the make-up of the 

pages. In a preferred embodiment, for parents/employers 25 default search results list can be amended by 'learning' from 

unlikely to use the computer themselves, notification of such the use's behavior to create a changing customized search 

usage is automatically provided by letter to the parent/ based on the user's own search patterns. If a user consis- 

employer that lists the keywords selected and web pages tently chooses new web pages or high-flying web pages for 

visited by the children/employees. This information is found example, ten their set of default search results will be 

directly from each user table 174 of FIG. 5.This requires a 30 changed to reflect their normal search style, 

user identification code that also included parental/employee Magazine Search 

information. - The magazine search according to the present invention 

Collective SeaTch enables users to search by following a series of menu-driven 

*lSe collective search, as illustrated in FIG. 11, is the subject choices (or similar hierarchical structure), rather 

default search according to the present invention and is used 35 than entering a specific key-word(s). 

when the user does not actively choose on of the other search Existing magazine-style search engines require editors to 

options. set tne structure of information, decide on its relevant merits 

^ypon entry of a keyword in step 402, that keyword is used and set the criteria, such as price, for space on a given page 

to select from a combination of web page selections asso- transmitted to the user/viewer. Using the search system of 

ciated with that keyword. A shown, for example, in step 404, 40 the present invention, the users' themselves dynamically 

an equally weighted combination of conventional, popular, decide what is and is not worth seeing. Thus, although 

highflier, new and past search results is used to obtain a list editorial input is needed regarding a hierarchy of subjects, 

of web page numbers. Thereafter, in step 406 the list of the web-pages that emerge as the most popular for each of 

web-page numbers found from step 404 is combined with these subjects will evolve automatically, 

the URL address and web-page description from Table 2 45 Use of Data Sets for Different Groups of People 

(188 FIG. 5). In step 408 the resulting list of web pages is Different popular hit-lists may be employed to provide 

then tagged, depending on the results of step 246 in FIG. 5 results which would reflect different cultural, geographical, 

as described previously, and sent to the user for them to professional, gender or age interests. Thus, as shown in FIG. 

make their selections, the system is first configured, the 14, when a user enters a keyword and User ID in step 490, 

search engine 10 database will not posses any information 50 the default profile of the user can be used to reflect the type 

on popular, high flyers and new web page hit-lists, so search of web pages that people of the same "group" as the user 

results will initially be obtained from the conventional profiles desire to see. Thus, the search that takes place in step 

hit-list (normal search engine), and the tagged web pages 494 is based on the subscripted X, Y and Z values obtained 

then used to create the database sets as have been described. from the default profile of people of those "group" affilia- 

As the system develops, the data sets associated with each 55 tions identified in the user's personal profile obtained in step 

of the other search types will become populated, and 492. Thus, the rather than an overall global search result, 

searches using the other search types will become more search results are obtained particularized for the group that 

useful the user identifies with. The resulting list of web pages, 

Date Created Search derived from steps 496 and 498, as have been previously 

FIG. 12 illustrates a date created search at allows the user 60 described, are particularized for that group, 

to select the date that the web-page was submitted. This Thus, for a particular user with the profile type New 

feature will only work for web-pages that contain a date Zealand selected as a geographical factor, a search for team 

created data entry, identified as date-time submission 74 in field sports and related key-words, rugby material might 

FIG. 4. Upon entry of a date-time and/or a keyword in step figure prominently, whereas an American profile type may 
432, the search engine 10 will perform step 434 in which a 65 produce a bias towards baseball/American football material, 
list of web page numbers associated with these variables is for example. This technique offers the ability to discriminate 
obtained Thereafter, in step 436 the list of web-page num- between the different meanings of the same words, accord- 
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ing to Ihe context of the popular hit-list associated with a determining which sets of results get sent out with "tagged" 

particular profile type. A general search using a key-word web pages. Reference with respect to this was already 

'accommodation' for example would include results related mentioned with respect to authenticating user's of a particu- 

to housing, renting and similar whereas if the user indicated ^r group, doctors in the example provided, 

an interest in optometry in their profile type, then the term 5 As shown in FIG. 16, after entry of keywords and other 

'accommodation' would be interpreted quite differently. data in step 554, there are three decisions that determined 

Hie relevance of such sites will evolve automatically, whether results are actually ^tagged" as has been previously 

without any active evaluation of the sites by the search described in step 118 of : FIG. 3. 

engine operator or the user. There are no complex olgorithm u As shown by step 556, for a user that has a user ID and 
required to analyze me relevant of web-sites for particular 10 ^ ho f n 10 « ^ f ^sonal hnks table 174 of FIG. 5 
t^esofusers.teteao;thetypeofsited^medreievantwill (™* 6) as previously described it is necessaryto tag all 
be decided by those users selecting those characteristics for ° f ^ir results so that aU of their past preferences are 
their profile type , i.e. American females interested in rccorded ™ ^eir P ersonal Lnks table 74 * ^ ' en ^ e 
rock^hmbing. Sites of greater relevance will naturally according to the present invention system can update the 
attract more hits, increasing their ranking and thus increas- is «^ t*™ a ? preferences but not update Table 3 if certain 
ing the chance of a subsequent user also investigating the security ^vels have not been sahsfied (see below). If, 
site. In the above example, any web sites listed for the however, the personal link table 174 is stored on an inch- 
keyword 'accomodation' which were unrelated to "dual's TOin P uter rath ^ r man at locaUon there ^ no 
optometry, sight, lens, vision, etc., would not be accessed for nee A d * ° ut lagged results as the data is stored locally, 
the period of time required to make a valid it. It would 20 /t show ° ^ St ?£ ^^'^T^* *f ^ ! 
therefore receive aver? low ranking and hence be even less » ™de *at the IP address ; 62 \hanV^w*M 
likely to be accessed by Her users. * e usmg security table 168 (Table 7) before tie 

The user can select different profile types for different ™" 15 * of tagged results If so, the user can still 

searehesduringasmglesessionandisnotberestrictedtom^ ^dertake the search though it will not contribute to the 

default profile types 25 cumulaUve surfer trace 170 (Table 4). Triis allows all normal 

In a further embodiment of the invention, there can be «i» to affect the popular hit-list and all users to search 

included a level of authentication for person's of a certain wh f ver would like, but prevents ^fraudulent ^users, 

group to have their search results ally be used for purpose of ™* ™ spammers from contributing to the popular hit-list, 

updating the database relating to that group. For example, ^security table 1^ can also include formation on links 

doctors who have a user ID that identifies them as doctors 30 between keywords 52 and a user ID 56 to detect repeat 

may perform a search related to a certain medical condition, ^^ a .ipg ; , ^ c 

and their selections can be tagged and used in the database While it is possible for user s to change the IP address of 

for that group of doctors as has been previously described. meu computer this is also detectable and prevenUbie by a 

However, although patient's may desire to identify their number of methods such registering and tracking the use of 

profile with that of the same group of doctors, their selec- 35 IP numbers. 

dons are not as significant as those of the actual doctors, and ° tber methotk to exclud e false searches include: 

thus while they are able to view the web page listings that ° nl Y creating a surfer trace for users with a user ID 554 

doctors deem most pertinent, their selections are not used to recorded with the search engine. 

update the doctor's group database, since their IDs do not Extending the time limit requited to make a visit count as 

identify them as a doctor 40 a useful hit. 

Limiting Search Options Do not count single visits to a URL from a keyword (for 

Another feature of the present invention is keyword which there is no means of measuring a lapsed- time), 

eliminator feature, which is illustrated in FIG. 15, and As shown by step 560, popular keywords can be traced 

prevents certain users such as children, from searching for once every tenth, hundredth, or even thousandth occurrence, 

undesirable keywords and web-pages when the keyword 45 and the frequency of this selection can be changed to 

eliminator feature is turned on. The present inventor's have optimize the system. The frequency of keyword usage is 

realized that it is potentially much easier for example, to stop determined from keyword table 164 as shown in FIG. 5 

children searching for pornography, rather than attempting (Table 1). The frequency of sending out tagged results can 

to trace and prevent access to all sites on the Internet with also be linked to the rate at which popularity is changing for 

pornographic content. This would be used as a complimen- 50 different key words. For example the keyword "IBM" would 

tary tool to existing "net nanny" type devices. Thus, as probably have IBM's home page at the top and most user's 

shown in FIG. 13, with the keyword eliminator turned on, a would gp there, whereas the key word "latest fads" may 

preexisting table inaccessible keywords is stored in a table have a constantly changes set of web pages that needs to be 

and compared in step 522 with a keyword previously sampled more frequently. 

entered, as shown by step 520. Thus, keywords that are 55 To avoid the keyword URL link table 172 of FIG. 5 (Table 

inaccessible will not be searched. Thus, for example, parents 3) from becoming unduly large, one method is to only 

could choose the types of keywords 552 that they do not register keywords in Table 3 once they reach a certain 

want their children to search for — and this will be different frequency of usage. This is controlled by not sending out 

for different sets of parents. The system filters out the tagged rests for less frequently used keywords (found from 

keywords that may be used for subsequent searching in step 60 Table 1). 

524. Active Suggestion of Web Pages to Visit 

Determining Which Users to Sample Another feature of the present is illustrated by FIG. 17, 

FIG. 16 illustrates the process of determining which and involves using data to actively suggest web pages. This 

search results should be sampled and used to make up the is different from a search because the user sets up the request 

cumulative surfer trace table 170 of FIG. 4, also referred to 65 and is informed if there is any new data on the subject. To 

as Table 4. While possible, it is not necessary to collect data do this the users has to actively specify which keywords they 

concerning every single search, and this can be controlled by are interested in and the profile type that they would like to 



US 6,421,675 Bl 
27 28 

act as a filter or agent and the search type (new, highyflying, tried using other search methods, identified in step 646. Each 
popular) in step 588. This information is stored in the user's keyword is then tagged in step 660, and those that are 
profile ID 166 shown in FIG. 5 (Table 5). selected by a user in step 662 are used to form a keyword 

Thus, at various interval's the user receives a list of surfer trace 648 as shown in FIG. 19, which contains the 
suggested web pages determined by a group of like minded 5 original keyword 52 that the user entered, the keyword 
humans. For example a user may choose to be notified of selected 652, and the IP address 130, user ID 128 and 
web pages with the following. date-time 132 data as in the previously described web page 

surfer trace. 

The data from the cumulative keyword surfer trace 648 is 

~" 10 then used to reinforce links between keywords. In this way 

Keyword 582 profile type (agent 588) Search type 586 ± learns which keywords are associated with each 

Rugby New Zealand, Male highflying . 3 _ _ t . t « i * j * «_ 

Decay treatments Dentist new other. The system learns which words are related to each 

— other in the same way that the system learns which URL's 

.„ t •_ „ are associated with the key-words. The lists of suggested 

This way if there « ^3W^ WOJin*r k ^ will become more relevant over time as the 
that other New Zealand males found useful (i.e. they spent * f ^ 

a significant amount of tune looking at the information — a 

high rate of change of X in Table 3) the user would be u f ed *. ^ 0 4 . ^ . ♦uc ♦ , ^ , 

notified. Similarly if there was any new information on Creating Data Sets that Determine the Suggested Keywords 
"decay treatments" submitted for dentists to look at, the user As shown in FIG. 20, a keyword link tab e 696 and a 
would be identified about it (value of Z in Table 3). It is 20 cumulative keyword trace table 698 are used along with the 
unlikely that a computer agent will ever be as good at previously described security table 168 to create the data 
filtering information as a selected group of peers. An advan- sets for suggested keywords. The key-word link table 696, 
tage of this system compare to other "agent type" software shown in Table 10 below, records how often each key-word 
is that this does not require any software on the user's is selected from the suggested key-word list This can then be 
computer. It is all included as a natural extension to the other 25 ™*<* *> rank the of the usefulness of different key-words 
search engine data sets. rclative to each other - 

The suggested web-sites can be displayed for the user 
when they next access the search engine or they may choose 
to be notified of these suggested web pages via e-mail 
notification. This way web pages can be drawn to the user's 30 
attention without any active searching for these keywords. 
Passive Suggestion of Web Pages to Visit 

Another feature of the present is illustrated by FIG. 18, 
and involves automatic web-page suggestion based on how 
the user has searched in the past and requires no active input 35 
from the user. 

As shown, in step 620, upon the entry of a user ID, the 
system can be activated passively, at various intervals or 
times (such as at each login to the search engine), by looking 
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Kevword link Table 




Key-word 
1 


Key-word 

2 


Key-word Key-word Key-word 
3 4 5 


Key-word 1 
Key-word 2 
Key-word 3 
Key-word 4 
Key-word 5 
Key-word 6 
Key-word 7 


20 


5 

356 
20 


1134 
3 

168 



at which so keyword*, profile ypes and search types .the 40 ^ from ^ ^ w ^ le who eD , ered 

USmg * e t^Jlh™ key-word 2 found key-word 3 the most useful followed by 

of FIG 5 Table ^ For example it may ******** 5 ^ fc ^ j ^ k ^ can have a 

frequently looks at Rugby information as a New Zealand for , k ^ 3 was fouDd ^ M 

male" and looks at decay treataents as a dent*t» Ttos 1134 ttoM ^ lrying key ^ otd Z However keyword 2 was 

jnformation can be found from the automatic b^k markmg 45 found ^ ^| ^ ^ ^ ^ oid 3 

Uble previously referred to personal links table , 174k I ^the Information blinks between keywords in Table 10 

user has not looked at these subjects for a certain length of d Molmiaott about how p^fe ^ ^ 

time and there are new or h^ng information sources, ^, rds(k eywordsurfe r traces 648). The cumu- 

the user wUl be automatically notified of these new infor- ^ ^ ^ m ^ ^ rombm J mformation 

mahon sources. -. from all individual keyword surfer traces 648 and it is used 

In a modification of this embodiment, a periodic e-mail bow m « mts „ ^ each 

can be sent out with the two newest and highest flying sites rf ^ for ^ ^ rf 

£ lat !lf2.i£l^ 0 r* ot ^^ . The information collected from each individual surfer 

^^^^M^m^^W^ . u-aceisaseesofinputsbecomeacumulativekeywordsurfer 

A problem with Internet searching for Many users is 55 . r c , . . TUa ^ 

. ^ v . . . . , 4 r & . . tU trace, shown ui table form below in Table 11. 

knowing which key-word to use for searching. While the__ ' 

present invention could be implemented with au infinite 
number of keywords, too many key-words (includes 

phrases) that users choose can be problematic. Keyword cumulative surfer trace 

Accordingly, as shown in FIG. 19, the present invention 60 — — 

also provides for a data set 642 that provides synonyms for ' T u _ fT t „ r*™^ * rw H - 

, / , , . i ci . IP Number User ID (original) (suggested) Date-tune 

the keywords entered along with the particular profile type 

in step 640: The system represented in FIG. 19 is referred to 

as a key word suggester. This is implemented, in -one FIG. 20 also illustrates how links between keywords in 
embodiment, by matching the key-word entered by the user 65 Table 10 can be initiated by recording sequences of key- 
in step 640 with the existing keywords and phrases in words that users put into the search engine. If for example 
keyword table 164 of FIG. 5 (Table 1) that other users have someone searches using the keyword "NHL" and then 
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"National Hockey League", this would then draw an asso- 
ciation between these two key-words in Table 10 by record- 
ing this as one hit. Again this captures the reasoning power 
of users to define the link between two keywords. Often the 
keyword in sequence will be totally unrelated to the previous 5 
key-word but sometimes it will be relevant If the next user 
chooses it from the key word selector it will reinforce the 
key-word link in the same way that repeat selection to web 
pages reinforces links between a keyword and a URL. 

n^fpllo.wing is an example of keywords that may be 10 
suggested after entering?the*a v simple key-word like "Book'* 

book sales 

book reviews 

specialist books 15 
second hand books 
a used books 
special edition books 

All of these key-words (phrases) would come from infor- 
mation seekers (users) and information providers (web-page 20 
developers). The most appropriate keywords will emerge 
naturally over time. 

All keywords used by users are entered into the key-word 
link table 696 of FIG. 20. Thus, if people enter an uncom- 
mon keyword such as "cassetes" instead of "cassettes" the 25 
key-word suggester will suggest that tithe user tries "cas- 
settes". There is therefore, no need to create a set of 
URL-keyword links in Table 3 for "cassetes" Thus saving on 
data space and here is also no need to send a tagged set of 
results for the keyword "cassetes". Hence there will be less 30 
data sent back to the search engine. 

It is also a contemplated embodiment to run the keyword 
suggester like Table 3 and have high flying keyword asso- 
ciations and new keyword associations so the system can 
leam how keyword associations change over time. For 35 
example, the keyword suggester trace wan store the most 
recent keyword links and modify the main key-word the by 
a history factor, in tie same way as Table 3 is modified by the 
cumulative surfer trace. 

The cumulative keyword surfer trace 698 is processed in 40 
the same way as the cumulative web-page surfer trace 170 
of FIG. 5 to reinforce links between keywords in the 
keyword link table 696 (Table 10). A time variable can also 
be included so that if a user chooses another keyword very 
quickly it is assumed that the previous keyword was rot 45 
useful and is not counted as a keyword surfer trace. 

Also, the individual keyword suggester can store, for each 
user, their personal keyword links. Further, the keyword 
suggester can be based on a number of different profile 
types. The word associations may be quite different for 50 
people of different culture, nationality, occupation and age 
etc. Different keyword suggesters wan capture the keyword 
association of different groups of people. The keyword hits 
in Table 10 can be subscripted in the same way that the 
values of X, Y and Z are subscripted for different types of 55 
profiles in Table 3, as explained previously. Using the Tables 
to create a list of suggested keywords 

FIG. 21 illustrates a variety of manners in which a list of ^ 
suggested keywords can be created. 

One manner is by ranking the values of X in the keyword 60 
link table 696 (Table 10). This ranked list of keywords is 
combined with keywords from a normal search of keywords, 
described previously with respect to step 646 of FIG. 19. 

Another manner of suggesting keywords, shown as step 
730, is to compare the popular list (URLs X values) Or the 65 
user-entered key-word with the popular-list of other key- 
words in Table 3. A similarity pattern X values in Table 3 
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indicates that these keywords are similar. For example a user 
may search for "film reviews" and the keyword suggester 
may come up with "movie reviews" which has a more 
comprehensively searched list of sites. In this case there is 
no physical similarity between the words movie and film, 
but they are linked by the similarity of the patterns of URLs 
links they have in common in Table 3. 

The usefulness of the key word suggester list is enhanced 
indicated by step 744, by associating with each key-word on 
the suggestion list an indication of whether there are any of 
the aforementioned searches available (popular, high flyer, 
etc-) for that key-word in keyword URL links table 172 of 
FIG. 5 (Table 3). The keywords with the most search results 
are then highlighted. 

Decision to Send Out Tagged Keyword Suggestions List 

The security table 168 and keyword link table 696 are 
used to determine which keyword links to sample in a 
manner similar to that previously described with respect to 
tagg web pages<As with the decision for tagging web pages 
this can depend on whether it is a repeat keyword (fouhd x 
from security table 168) and on the frequency of keyword 
usage (found from keyword table 164), as well as the 
considerations previously discussed. 
Determining Other Content 

When searching on the Internet, various different web 
pages listings and web pages are displayed as has been 
described. One common chacterisics of each these different 
web page listings that have been described is that when they 
are displayed they appear substantially identical to one 
another. As shown in FIG. 25, each of the different listings 
900, though the text may be different, is otherwise visually 
identical. Other listings 902, however, are many times larger 
than the listings 900, may include graphical content, and 
appear more prominent when displayed to the user. Such 
listings can contain the same content as a web page listing, 
or other content, such as advertisements, pictures, editorials 
and the like. 

This* other content maybe displayed to a particular user 
based upon key-words; user profile type (nationality, age, 
gender, occupation, and so forth) and the time of the day, for 
example. 

In many instances, this content that is displayed along 
with web page listings is inserted into the display area using 
mechanisms that are different from the searching system 
described previously with respect to conventional search 
engines. The mechanism by which this content is displayed 
in large measure based upon some other criteria, such as 
payment for the space that is used. While the system for 
selecting this content works, it is difficult to keep track of 
which content was displayed when, especially if that content 
is frequently changed. Thus, another aspect of the present 
invention, which will now be discussed is a system for 
tracking changing content, and allowing for content provid- 
ers to dynamically select when their content will be dis- 
played. 

This dynamic selectable content as illustrated in FIG. 22, 
-maybe displayed to the viewer based upon,, keyword .or 
profile type as entered by the viewer in step 762 as shown. 
Within the content selector step 764 that then follows, the 
time of the day is considered and used in selecting the 
appropriate content 902 as illustrated in FIG. 25 along with 
the web page listings 900. Each content 902 transmitted with 
the search results made up of web page listings 900 is tagged 
in step 766. Thus, if a user in step 768 selects that content 
902, the results of that selection is fed back to the content 
selector 764 so that the content database associated 
therewith, can be updated as surfer trace data in a manner 
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such as has been previously described. Thereafter, in step 
770, that content 902 is displayed, typically simultaneously 
with content 900 

In addition to the surfer trace data being input as has been 
previously described, this content embodiment also provides 
for the web page developer, or content provider, to deter- 
mine the frequency with which this content will be 
reviewed, and, depending upon the patterns of users with 
respect to web page listings that are viewed alter the manner 
in which the content provider's content 902 is displayed 
based upon key words, user profile and the like. In order to 
implement this dynamic content flexibility, there are three 
additional data tables, illustrated in FIG. 23, which are used 
to track the changing content 902. These tables are keyword 
content data table 804, personal profile content data table 
806; and content provider data table 812. 

Keyword content data table 804 is illustrated in more 
detail in Table 12 below, and its characteristics are: 

H is the cumulative number or hits for one time period for 
the keyword. This is the number of times people choose 
that keyword; 

N is the number of times particular content 900 that is 
associated with a keyword has been sent out for display. 
This is not necessarily the same as H since content 
associated with a profile type may be have a different 
selection factor than content associated with the key- 
word. Tis selection factor can be various variables, such 
as votes or price; - - 

.^isktto from each 

^ content provider(e:g/a/sd 

to*be^associat'ed" with "that keyword); 

T is the total of the selection factors for each keyword and 
is the sum of A's; and 
^ P is*thexcontent r value, as.determined by votes or price, for 
each keyword and is T/N (e.g. this could be the $ per 
time content is sent out with that key word — this is a 

c price of being associated with that key word) 

TABLE 12 



Keyword content data sets 



Keyword 



Cumulative 
hits for one 
month (H) 



Amount of 

Content Content 

sent out Provider 1 
(N) (Al) 



Content 
Provider 2 
(A2) 



Total 
00 



Books 
Fish 
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In addition to the key-word dataset 804 it is possible to 
have a data set of the following type for different profile 
types 806. It contains the same entries for each profile type, 
instead of keyword as described above with respect to the 
keyword content data table 804 of FIG. 23. 



TABLE 13 



10 



Profile type 



Personal profile content Table 

cumula- Amount of Content Content 

rive Content Provi- Provi- 

hits for sent der der 

one out 1 2 

month (H) (N) (Al) (A2) 



Total 
CT) 



(P) 



15 



Male 
Female 

Professional etc 
Undefined profile 



20 



Table 13 determines the content value of the content 902 
to specific audiences of people as opposed to different 
keywords and allows for targeting of specific audiences. 

It is within the scope of the present invention to include 
combination profile types in Table 13 as well, such as male, 
professional or New Zealand, females. The content value for 
25 the combined profiles will be different am the content value 
of individual profiles. The mechanics involved in determin- 
ing the content value and choosing the content 902 will be 
the same, and described further hereinafter. 

Content provider data table 812 of FIG. 23 is illustrated 
30 in more detail below as Table 14 and contains information 
about the content provider, such as name, address, 
advertiser, content information such as the Bitnap (HTML or 
Java applet or similar) that the content 902 will use and a 
unique number to identify each different item of content 902. 
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TABLE 14 



Name 



Address etc 



Unique number 
Content Information for each Content 



40 E.g. John 



Content no. 
Content no. 



(P) 
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This Table can also include the maximum content value M 
that the content provider is prepared to give. There is no 50 
limit to the number of content providers that may attempt to 
have content 902 displayed with a web page listing that is 
associated with a particular keyword. 

It is possible to have a separate Table 12 for each country 
or area, so that the content value per country or area, per 55 
keyword could be different. In addition there could be 
different content values for different time periods in each 
country or area. 

It is possible that provider's of content 902 could target 
both the key-word and the audience by identify each of the 60 
keywords with target audiences, e.g. the number of hits 
associated with the word rugby could be broken down into 
the different profile type s that search for the word rugby. 
The cumulative number of searches for rugby could be 6000 
split into 520 under 21 's and 4000 21-50 year olds and 520 65 
50=age group. Thus, there may be a different content value 
for each of these sub classes within a keyword search. 



This Table may also store details of the content provider, 
such as passwords, payment details (e.g. credit card number 
and authorization), content delivery (number of times con- 
tent has been sent to users) etc. The data sets for the above 
mentioned content tables are populated as follows. For the 
keyword content data table 804 

H, the cumulative number of hits for a particular key word 
for one time period, is taken directly from Table 1 
(800). 

N is the number of times content is sent out associated 
with the keyword This is incremented each time all 
item of content 902 is displayed to a user that is 
specifically associated with that keyword 810. 

The values for A 802 are selected by content providers for 
each keyword. The content provider can also enter a 
maximum value M over which they will no longer 
select to be sent out with the keyword. 

T is the total for each keyword and is the sum of As 

P is the content value, as determined by votes or price, for 
each keyword and is T/N 
Populating the Personal Profile Content Data 

H is the cumulative number of hits for each profile type 
and this information is taken directly from Table 1 (sum 
of the indexed W's). 





Cumula- 
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month (H) 
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(Al) 


(A2) 
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Book 


134 


134 
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10 


0.050 


Fish 


52 


80 
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10 


0.52 
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N is the number of items of content 902 sent out associ- of content 902 is displayed would be incremented only for 

ated with the personal profile. This is incremented each the 'male under 21' Table (this is the figure used to deter- 

timc an item of content 902 is sent out that is specifi- mine the value of the content per unit view, 

cally associated with that profile type 810 Determining Which Specific Content is Transmitted 

The values for A 808 are placed, through an entry process 5 ^ example below shows how content associated with 

akin to bidding, for each profile type. The content the keyword is selected. It is the same process for content 

provider can also eater a maximum M they are prepared associated with profile types, 
to pay, or vote, as the case may be. 
T is the total for each profile type, and is the sum of As. 
P is the content value for each profile and is T/N 10 
Populating the Content Provider's Details Table 

The majority of the content provider's deals 812 are 
electronically entered by the content providers. Bach time a 
content provider's content 902 is sent out this event is also 
recorded in the content provider's details Table 8 12!. This is 
will also record the number of click-throughs (820, 822, 824, 
826, 828) and the cost, in terms of payment or votes, of the 
content 902. This will form the basis of the electronic bill or p or t he key-word "took" the content 902 of content 
tabulation that is thereafter forwarded to the content pro- provider 2 would be displayed whenever the keyword was 
vider. 20 searched, as they are the only content provider associated 
How the Data Sets are Used to Select Content Sent Out to wit that key-word. However, for the key-word "fish", con- 
Users tent providers 1 and 2 would have their content sent out the 
In the discussion that follows, with reference to FIG. 24, same number of times. In the system scaled to the levels at 
it is assumed that only one banner of content 902 is which it is intended to be used, them will be a very large 
transmitted with each set of web page search results 900. 25 number of ixmtent ;providers*bidding for different keywords 
The same olgorithm apply if there are multiple sets of and profile types, 
content transmitted with each set of web page results. , Calculating the Value of Content 

A keyword and profile type are submitted to the search If there is a new content provider who, for the keyword 

engine in step 852. From keyword content data table 804, "book," values the content at, for instance, $5 per month, 

personal profile content data table 806, the value of content 30 This will change the value to 0.075 and this will mean that 

902 for each is found from the value of P in the Tables. The the total associated with the word book is $15. Therefore, 

highest value of P for the keyword or profile type, deter- content provider 2 would now get transmitted 66% of the 

mined in step 8624 determines the type of content (keyword time (10/15) and the new content provider would be dis- 

or profile type) that is transmitted along with the web page played 33% of the time. The proportion of time an content 

listings 900. It may be that there is no specific value for the 35 provider's content is transmitted is A/T. 

keyword and the user may not be using a specific profile How Content Provider's Use the Data Tables 

type. In this case the values for unassigned content items When biddingfor content 902, content providers select a 

will be used (from Table 13 for users without a profile). keyword or profile to target their content from Tables 12 & 

Choosing which specific content item 902 is sent out is 13. The search engine indicates automatically the number of 

discussed below, The details for the content item (their 40 times this search has been performed for the previ6us ; time^ 

graphics, text, associated programs, etc) are taken from period (H), the number of times items of content were senp 

Table 14, content provider details table 814 and transmitted put associated with that selection (N) and the value of the 

to the user in step 868. Details of the content items 902 content P. 

transmitted for each content provider are also sent to the The new content provider then enters the selection factor - 

content provider, as shown by step 870, at regular intervals. 45 ^Aand the system can then instantly calculate the new value J 

Determining Whether it is Keyword or Profile Content That (P) based on the new total bids (T), The advertiser can also 

is Transmitted ^be told the number of views per month they are likely to get 

The type of content 902 transmitted is dependent upon for their bid (N*(A/T)). These changes are calculated in 

whether it is a key word based content or profile option real-time to give the new content provider an indication of 
based content. For example a Male from the US may search 50 how their bid will influence the value and the views they will - 

for fish. The value applicable to this search is, keyword=fish, receive for their bid. If a value and number of views are 

profile=male, profile-US, profile US, male. When deciding agreeable to the advertiser they can choose to submit it as a 

which content gets displayed the system compares the value bid for the denned period, such as a day, week, or month, for 

of the content for all the possibilities (keyword combinations instance. The details of other content providers are, 
of profile types) and sends out the content that has the most 55 preferably, not made public. Content providers may also 

value as determined in step 862. For example an under 21 enter a maximum value M they can pant with for their 

male may search using the key-word "Rugby" and the value content. This provides content providers with some security 

for the associated content for Rugby is 0.1 per view, whereas against paying too much if the value changes. If the value 

the value per view for targeting an under 21 male is 0.2 and goes too high then a content provider's bid can drop off the 
thus the content targeted at the male under 21 would be 60 list (if P is greater than M then A is not counted as a bid for 

displayed rather the rugby content. It is important to note that particular content provider). The bid would go back on 

that the cumulative frequency of times that content items the list if the value went down again, thus acting as a 

902 are transmitted (N) will be different to the total cumu- stabilizing mechanism. The content provider can, in a pre- 

lative frequency for the targeted area (H). In this example the ferred embodiment, be notified by e-mail if their content 902 
cumulative frequency (H) of the number of times 'rugby' is 65 has dropped off the list due to their value limit M . 

searched for and 'males under 21' would both incremented As shown by the content provider details table 812 of 

by one (via Table 1). However, the number of times an item FIG. 24, for instance, content providers thus have an account 
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with the search engine proprietors and procedures for deb- keyword. The default setting is that a hit on content counts 

iting their account for their content is automatically calcu- the same as a hit from the list of web pages. The value of 

latcd from the account details on a periodic basis. An content hits can be set higher or lower than unity, depending 

electronic statement of the number of views, cost per view, Q n the price of the content, e.g. the "content hit factor" may 

number of click-throughs and cost per click-through for each 5 nee d to be increased for valuable keywords as this would 

content provider is also forwarded to each content provider, decrease the ability to spam these commercially valuable 

since this information is also stored in content provider keywords. The higher the content factor, the higher the 

details table 812 (Table 14), In a preferred embodiment, it is resistance to spam as the search results would be more 

possible to identify clusters of similar keywords based on dependent on price rather than popularity, 

^thekeywordlir^ m time period for content bidding 

keywords is so that content 9«2 can be targeted at groupsof / Content rovideis bid a ocrtain for a particular 

words rather than just mdividual words. The cluster for the/ ^ ^ Qne m(mth ^ ^ period may ^ 

key-word "car" may include hundreds or thousands of words d . Qn ^ rate . of ^. h of me price . If the 

that have links to the word car (e.g^ convertibles fa c r idl or is stablCj tne time 

automobiles, vans). Statistical clustering techniques are used ^ * ^ ^tensd or lengthened correspond- 

.to define the size and frequency of key 7 word clusters. 1ms . ' 

makes it a much more automatic process the an editor _ ' , , , « . . 

Riding on clusters of keywords for content provider's to SZTU the 

^ l!e same system can be used to set values for keyword 9i) ***** leams new web W submissions, 

clusters. While grouping words in this way would incur an 20 ^h of time between accepting new-web-page submis- 

increased administration cost, it is nevertheless computa- sions p 

tionally similar and only initiated once a certain level of hits If the date of submission for a web-page is too close to the 

on a keyword had been exceeded. existin g submission for that web-page, Uien it is not 

Content only search Users can also purposely choose to ^ accepted. Tnis length of time can be changed depending on 

search only the content provider associated with a keyword. ^ of me above k ctofS 

In tins case the search results will be based on the values of Number of searches per day, per person (IP address or 

A in Table 12. Thefecontent providers that pay the.most will user ID) that count as valid hits 

be at the top of the list. This number can be changed to reduce the possibility of 

/ The key-word suggester can also help content providers ^ spamming 

choose key-words or sets of key-words that they would like Length of time before renewing the security Table 

So display. The security Table that restricts abuse, notes the links 

Controlling the search engine system There are a number between keywords and IP addresses of user identifications, 

of parameters that can change the way in which the search The length of time between refreshing this Table can be 

engine according to the present invention ranks web pages. 35 changed to make it harder to spam the system. 

These factors (described in detail below) are: The settings for these factors can be different for different 

History factor keywords or groups of people depending on: 

This determines the rate of decay of the existing popular Frequency with which searches are done 

lists (popular hit list) as described in the text previously This The rate of-change of frequency of searches 

is a number between 1 and 0. A high history factor will make ^ The price of the content 

it difficult to change the existing popularity lists. As an Tn e rate of change of price of content 

example if the rate of searching for a particular keyword is The precise setting of each of these factors will not be 

increasing quickly, then the history factor should be lower to known until the system begins operation * learning' about the 

enable emerging web pages to rise up the popularity list. behaviors. The optimum settings for different situa- 

Frequency of updating Table 3 from the cumulative surfer 45 tions may be determined by experimentation, 

trace Other Applications 

This is a measure of the frequency with which the Though the preferred embodiment has been described 

popularity lists are updated with information about the with reference to a software useable on a computer network 

users' activities (i.e. the surfer trace), for example, this may for searching the Internet, it will be appreciated that the 

be measured once a day or even once a month depending on 50 invention may be readily applied to any search system where 

the rate of change of popularity of particular keyword a human user chooses results from a set of initial search 

searches. results. Such a system may for example be part of an, a LAN 

Sampling Frequency or WAN or even a database on an individual PC. 

This is the frequency of sampling the information of how Examples of other possible areas of application for the 

users are searching. If it is a common keyword it is not 55 present invention are described below, 

necessary to monitor every search. It may be that only a Intranet Searches and Other Data Base Searches 

percentage of all searches need be monitored to accurately Intranet searches at present suffer from similar drawbacks 

determine web-page popularity. from Internet searches, indeed some intranets can in them- 

The composition of the default search list (mix of results selves be extremely substantial systems, in which identify- 

from the new web-page list, high-flyers and popular- 60 ing a particular information source or item can be equally 

lists etc.) problematic. Utilizing the present invention in such appli- 

The mix of web pages presented to the user as a default cations is within the intended scope of the present invention, 

can be changed if necessary to reflect the way in which Searching Other Media Forms 

search results evolve over time. The present invention is also intended to be applied 

Content * hit factor ' 65 matching a user's profile to other media sources (such as pay 

The "content hit factor" is a measure of the weighting per-view, television, videos, music and the like), thus allow- 

given to a hit on content being recorded as a hit for a ing content targeted to a particular audience. The same form 
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of search lists as described above (Popular-list, High-flyers, 
Hot-off the press etc) may be employed to direct users to 
appropriate material 
Shopping 

Trie search techniques described herein can be imple- 
mented in a consumer network to assist shoppers in select 
items from within one shop or among a large number of 
shops Instead of using a keyword-URL link Table, there 
would be used a keyword-item purchased link Table, that 
then records what items were purchased after each shopping 
request (key-word). This embodiment also records where 
the user purchased the product. Each time a shopper pur- 
chased an item this would increment the popularity of that 
item, using the same techniques described previously. 

The profile type s in this embodiment can be used to 
record the types of purchases made by different sets of 
people. One could, for example, select a profile type and see 
what are the most commonly purchased items for a range of 
users, and would provide assistance in choosing gifts for 
people who have a different profile type than yourself. 
Scientific Publications 

Searching scientific data bases (on-line papers, journals, 
etc.) with the present invention will dramatically reduce the 
time spent examining obscure, or esoteric areas only to find 
the information irrelevant e criteria for a valid hit for such 
uses would typically incorporate the extended time feature 
described above to establish the usefulness of the informa- 
tion source. The refereeing and referencing of academic/ 
scientific papers using the present invention could enhanced 
by classifying different levels or types of user, e.g. Dr, 
Professor etc. postgraduate, and so forth. This will enable 
users to see. for example, what information sources the 
eminent authorities in a particular field found of interest. It 
would also allows the authors of a paper to become aware of 
how often their publication was accessed and possibly 
further indicate where and how often the paper was used as 
a reference in subsequent papers. Users may have to for- 
mally register with different organizations to obtain levels of 
ability to referee. Users may also choose the level of 
refereeing for their searching. 
Online Help 

There is currently a substantial global requirement for 
online help and support particularly for computer/software 
applications. Such a need would be considerably assuaged 
by use of the present invention as the software developers 
obtain a direct feedback to the type and frequency of 
particular inquiries, whilst the uses receive the accumulated 
benefit of the previous users. Different profile type s would 
enable the answers to be provided in an appropriate form for 
the user, e.g. novice, expert, etc. The keyword suggester 
may, for example, suggest searching with key-words 
(questions) more likely to yield a satisfactory response. 
There can be a range of answers to each question and as the 
system learns it will converge on to the best answers. 
Question and Answer Services 

Current On-line question/answer programs could be con- 
figured to run via the present invention thus enabling 
answers to repeatedly asked questions to be based on 
previous questions and similar questions to be suggested. 
- Content Optimization on Other Parts of the Internet 

The same content bidding mechanism could be used to 
determine the price of content for any location on the 
Internet, not just web page listings as identified above. In 
this embodiment, content providers will bid for a general 
content space to set the price automatically. 

The profile type information from the search engine could 
be used as a passport so that other advertisements on the 
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Internet could be more targeted to different audiences. This 
profile type information could also be used by web-page 
developers to customize their web-page for different sets of 
users. 

5 People Matching Service 

In another embodiment, the system according to the 
present invention can be used as a dating service and/or a 
method for matching people with similar preferences by 
doing a statistical analysis to compare the individual pref- 

10 erences (Table 6) of groups of users. The individual past 
preference Tables, in this embodiment, would preferably be 
normalized and compared to each other using a standard 
correlation coefficient. When compared to other users it 
would give a numerical indication of how similar their 

15 preferences are. 

The same embodiment could also be used to find infor- 
mation about similar people from there past preferences 
Tables. For example one could ask to be give the names of 
people in New Zealand with an interest in Ecological 

20 Economics and a search could be made of the personal 
preferences Tables. Such an embodiment, however, would 
typically include a password/consent indicator that provides 
consent of identified persons to give out their information, 
which consent could be given, for example, in only certain 

25 circumstances, which circumstances are limited to searchers 
who have a level of authority and password indicating the 
same, or for persons who identity themselves with certain 
characteristics. 

While the invention has been described in connection 

30 with what is presently considered to be the most practical 
and preferred embodiments, it is understood that the inven- 
tion is not limited to the disclosed embodiment For example, 
each of the features described above can be use singly or in 
combination, as set forth below in the claims, without other 

35 features described above which are patentably significant by 
themselves. Accordingly, the present invention is intended to 
cover various modifications and equivalent arrangements 
included within the spirit and scope of the appended claims. 
We claim: 

40 1. In a computer network having a plurality of user sites, 
a method of weighting the relative importance of a plurality 
of data items stored in a database on a server computer 
comprising the steps of: 

receiving at said server computer one or more keywords 
45 from user sites; 

generating at said server computer a plurality of listings 
corresponding to said keyword, each listing also cor- 
responding to one of said data items; 
5Q transmitting from said server computer to one of said user 
sites said plurality of listings; 
detecting at said server computer which ones of said 
plurality of data items are selected by said user site, 
said user site being transmitted each selected one of 
55 said data items upon selection of said corresponding 
listing by said user site; 
updating said database to weight said selected ones of said 
data items as relatively more important than unselected 
ones of said data items with respect to said keyword; 
60 wherein said step of detecting only detects each key- 
word one time from each user site during a deter- 
mined interval of time; and 
wherein said step of generating uses a history factor 
associated with each keyword in determining said 
65 plurality of listings, wherein said history factor is a 

number less than or equal to 1 and greater than or 
equal to 0. 
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2. In a computer network having a plurality of user sites 
and developer sites, a method of populating a database on a 
server computer comprising the steps of: 

entering a plurality of data items into said database from 
said developer sites, each of said data items entered into 
said database including as associated identifiers a plu- 
rality of associated keywords; 

updating said database by entering a plurality of user 
traces, each of said user traces identifying one of said 
data items and an associated keyword so that each trace 
increases the relative importance of the associated data 
item with respect to said associated keyword, wherein 
said step of updating only updates each keyword one 
time for each user during a determined interval of time; 
and 

wherein said step of updating each keyword one time 
includes the steps of: 
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associating an identifier wilh each user, wherein each of 
said user traces including the user identifier, wherein 
said user identifier is used to update a user table; and 
using said identifier to track keywords that have been 
5 entered by each user. 

3. A method according to claim 2, wherein said user table 
includes user traces corresponding to a plurality of different 
types of searches performed by the user. 
10 4. A method according to claim 3 wherein one of the types 
of searches performed by the user corresponds to keyword 
searches that have been most recently updated. 

5. A method according claim 2 wherein a particular user 
has a plurality of user identifiers, each user identifier cor- 



responding to a different user profile. 



